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1. Introduction 

Arguably the most important open issue in statistical mechanics is the quest to 
understand the role of entropy and entropy production in systems far from equilibrium. 
In equilibrium situations, diffusion usually smoothens out gradients such that one 
would not expect interesting structure. Out of equilibrium, the situation is different 
and it is usually crucial to consider (dynamics on) multiple scales. This goes beyond 
the classical understanding of entropy as a thermodynamic state variable. 

Since computer simulations have become feasible, scientists have been trying 
to model and understand the microscopic dynamics of systems in contact with 
thermodynamic environments. In such simulations the environment is usually 
modelled by modified equations of motion, so-called thermostats [3D]. Thermostats 
can usually be divided into deterministic and stochastic thermostats. 

Soon after their introduction, dynamical systems theory got interested in 
deterministic thermostats. Phase space contraction in such systems was understood 
to be related to physical entropy production. Also, the first fluctuation relations for 
nonequilibrium systems arose in that context [ini [H] . 

Stochastic thermostats have a longer history and date back to the works of 
Einstein, Smoluchowski and Langevin in the early 20th century. Hill [T7] and later 
Schnakenberg [31j connected stochastic dynamics with dissipation in small systems. 
Since the late 90s, fluctuation relations have also been found for stochastic dynamics 

[511 [21 HI [33]. 

Fluctuation relations are concerned with the probability of rare events that seem 
to act contrary the 2nd law of thermodynamics. Whereas this is a phenomenon that 
cannot be observed in the macroscopic world, such fluctuations are quite common 
in the microscopic world of macromolecules in solution. The latter have become the 
paradigm of what today is called stochastic thermodynamics. Because of the advent 
of modern measurement and simulation techniques, much research is done on this 
interface between physics, chemistry and biology. 

For instance, the work relations by Jarzynski and Crooks, which allowed an 
experimental measurement of the free energy landscape of macromolecules, have been 
of huge impact [TH[ [^. While the former was originally derived in the context of 
dynamical systems, the latter was found for stochastic dynamics. Today, they are 
both understood as consequences of the above mentioned fluctuation theorems. 

Besides others [37|, this emergence is just one hint of a deeper connection of 
the entropy concepts in dynamical systems theory and stochastic dynamics. The 
reason for this is the intimate connection of entropy with Shannon's information theory 
[3S1 [ini [Ml [IS] ■ However, this is not generally acknowledged and entropy, but even 
more so, entropy production often seem to be an almost emotional topic with many 
different points of view. Such views range from restricting entropy to the realm of 
classical thermodynamics to promoting variation principles for entropy production as 
the general mechanism for nonequilibrium steady states. Some people only allow the 
phase space contraction in thermostatted dynamics to be connected with dissipation, 
other critize the phenomenological equations of motion as having no connection to the 
true microscopic dynamics. 

In this paper, we follow earlier works [S] [3H1 [211 [231 [THj and try to highlight 
the connections of dynamical systems, statistical mechanics and information theory. 
In order to connect the fields beyond pure mathematical considerations, we have to 
discuss some subtle questions: 
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• How do the crucial assumptions of (stochastic) thermodynamics rely on a notion 
of separation of time- and length-scales? 

• How is the information which is contained in a measurement affected by these 
scales? 

• How does thermodynamic entropy arise in the context of information theory? 

Finally, this will enable us to derive the modern notion of stochastic thermodynamics 
|35] . which is based on entropic trajectory functionals that can be obtained 
from deterministic equations of motion. Throughout the paper we will motivate 
interpretations directly at the level of definitions, to avoid possibly misleading a- 
posteriori interpretations that could lead to inconsistencies. 

The work is organized as follows: Section [2] reviews basic notions of dynamical 
systems theory with a focus on Hamiltonian systems. In section [3] we introduce 
stochastic equations of motion and briefly review Seifert's approach to stochastic 
thermodynamics . Section 3] motivated variants of thermostatted equations of 
motion to provide a framework for further discussions |30| . The main part of 
the work is section [5l There, we formulate the crucial assumptions of stochastic 
thermodynamics in the context of determinisic dynamics. Under these assumptions, 
we are able to construct exact correspondences of the stochastic entropy functionals. 
This is achieved by applying Shannon's notion of information to two different densities, 
reflecting the (unobservable) microscopic probability densities and the (observable) 
measured densities, respectively. 



2. Microscopic Basics 
2.1. Hamiltonian dynamics 

We will start with deterministic Hamiltonian dynamics on a phase space F. A point 
X = {q,p) & ^ represents the state of a TV-particle system in d-dimensional physical 
space consisting of the coordinates q and p of all particles Q For brevity, here and in 
the following we use the short notation q = {qk}^=ij P = {PfclfcLi when no ambiguity 
can arise. The equations of motion 

OH p . dH 

q=^ = —, P= = -dqV{q), (1) 

op m oq 

are governed by the Hamiltonian 

H[x)=V{q) + Y,^. (2) 

In our notation the term ^p^/27n in Eq. is short for X^feLi ^7)7 including the 
(possibly different) masses m^. 

For macroscopic physical systems the number of particles, N ^ O (lO^^), is very 
large. Even for much smaller, mesoscopic[| systems this number is still large, mostly 
because the system of interest is not placed in vacuum but in an environment consisting 
of many particles. 

X To denote vectors in physical space, we will use bold symbols like q. Elements of more general 
multi-dimensional sets will not receive any special decoration. 

§ By mesoscopic we mean the range of scales between typical molecular scales (lA = 10~^"m) to 
typical macroscopic scales (10""^ - lO^m) 
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2.1 Hamiltonian dynamics 



This renders the microscopic equations of motion ([T]) unfeasible for simulating 
meso- or macroscopic systems in thermodynamic environments. Even with state-of- 
the-art supercomputers, simulations of no more than a few (10^-10®) particles on 
small time scales (10^-lO^ns) are possible. Hence, finding effective dynamical models 
for fewer degrees of freedom is one of the main interests of modern (statistical) physics. 

Another problem with using the equations ([T]) is that they need to be augmented 
with microscopic initial conditions. Such detailed information is never accurately 
available for real complex systems. Together with numerical inaccuracies, this puts a 
perspective on the value of a single microscopic trajectory. 

To overcome these problems, statistical mechanics introduces the notion of 
ensembles. Mathematically, an ensemble is a probability measure on phase space 
r . Throughout this work we assume that an ensemble can be specified by a (possibly 
time-dependent) phase space density f?^*-* (x) Ijj] Ensembles are usually taken to reflect 
the probability of finding a certain microstate when only a macro- or mesoscopic state 
can be specified due to the finite measurement resolution. 

Because of the lack of experimental accessibility, one is not interested in the 
microscopic state x but rather in the current value of some measurable observable 
A . In classical thermodynamics, these observables where usually macroscopic (bulk) 
properties of largely homogeneous systems. Nowadays, due to a vast improvement in 
measurement techniques, the measurements are taken on mesoscopic scales that lie in 
between the molecular (microscopic) and the bulk (macroscopic) scale. The qualitative 
difference between meso- and macroscopic scales is that on the former fluctuations 
can be observed and may play an important role, whereas for the latter they can be 
neglected. In what follows, we will always think of an observable being defined on 
mesoscopic scales though the discussion remains valid for macroscopic observables. 

Mathematically, an observable is a mapping 

A:r^R, 
X ^ A{x). 

A measurement M consists of measuring an number of observables Ai . We summarize 
the outcome of that measurement in an abstract (possibly multi-dimensional) 
mesoscopic observable uj . The measurement A4 is a surjective mapping from phase 
space r to an abstract space of observations, il: 



X 1-^ U!(X). 

The ensemble average of an observable A is 

{Af^ = / g'^'\x)A{x)dx. (5) 



An empirical sampling of the distribution of the values of A is done by measuring 
this observable for many experiments or simulations. 

For real experiments, already the single measurement of an observable implicitly 
features a time average over some finite observation time Tobs^ : 

= — r^'°"^(^W)di. (6) 

'''obs Jt 

II This excludes, for instance, tlie so-called SRB-measures 1261 that appear asymptotically in the 
context of non-Hamiltonian dynamics, as described below. 
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2.2 Ergodicity and time-sampling of phase space 



One usually assumes that there is a separation of time scales ta ^ Tobs ^ ''mic between 
the time-scales ta of the evolution of the observable and an unobservable microscopic 
time scale Tmic associated with the dynamics in phase space. 

Given that, one can choose Tobs such that A(t) is effectively independent of this 
scale. 

2.2. Ergodicity and time-sampling of phase space 

Equilibrium thermodynamics relies on ergodicity, which is a crucial assumption relating 
time- and ensemble averages. The ergodic hypothesis assumes that there exists a 
stationary measure g°° for physical many-particle systems such that 

lini — / ^ ""^ A(a;(t)) di =: / g°°{x)A{x)dx (7) 

Toba^OO Tobs Jt Jr 

exists for any observable A and is independent of the time t when one starts the time- 
average (no aging). In other words, the ergodic hypothesis states that there exists an 
ensemble that samples phase space in the same way as an infinitely long trajectory 
would do asymptotically. Proving the ergodic hypothesis for actual systems is usually 
very difficult. However, it constitutes the dynamical microscopical approach to 
equilibrium thermodynamics combining statistical mechanics with dynamical systems 
theory. 

2.3. Equilibrium and nonequilibrium ensembles 

In contrast to the situation above, modern statistical mechanics seeks to understand 
the foundations of nonequilibrium systems. Physical nonequilibrium systems are 
always in a time-dependent transient state which may or may not relax towards an 
equilibrium on accessible time scales. Therefore for nonequlibrium situations it is not 
enough to consider an asymptotic density 

The dynamical behaviour of the density p^*-* is governed by the flow 

: r X F, 

(8) 

x{t) i-> x{t + t) 

generated by Eq. ^ . 

If the flow of a Hamiltonian system on the hypersurface defined by H{x) = 
Eq = H{x{0)) is ergodic, it always yields the so-called microcanonical distribution for 
constant energy i?o, 

g°°{x) (X xh{x)=Eo, (9) 

where x denotes the characteristic function. 

However, in experiments one specifies the temperature T of the environment 
(heat bath) rather than the total energy. Also, physical observables should depend 
only on the state of the system and not on the state of its environment. From classical 
statistical physics we would expect a (Maxwell-)Boltzmann distribution for the degrees 
of freedom of the system. One is inclined to ask for the "right" reduced equations of 
motion in a reduced phase space Tsys consisting of the degrees of freedom of the 
system only. Because we ignore the details of the interaction with the environment, a 
trajectory produced by such reduced dynamics does certainly not describe the actual 
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microscopic motion. However, as we argued above, single trajectories or instantaneous 
configuration do not really matter. The important thing is that the new dynamics 
generates trajectories that sample the phase space in accordance with macroscopic 
constraints. In other words, q°° must be the (equilibrium) distribution expected 
from the ensemble belonging to the macroscopic parameters of the environment (like 
temperature, pressure etc.)l3ll 

But since Hamiltonian dynamics will only generate microcanonical distributions, 
the reduced dynamics on T sya will be in general non-Hamiltonian. In the following 
two sections we review two different approaches to such reduced dynamics. 



3. Stochastic Thermodynamics 

Stochastic thermodynamics is a modern paradigm for the treatment of (small) 
systems in a thermodynamic environment. It is based on the notion of stochastic 
differential equations (Langevin equations) or the corresponding Fokker-Planck or 
path integral descriptions pRJ . Physically, the stochastic term that appears in 
the equations of motion models the interaction of the heat bath with the system. In 
the following paragraphs we review the basic concepts of stochastic thermodynamics. 
The presentations closely follows Seifert's work [33l [34] (for an extensive review see 
Ref. |35| ) though notation may vary. 

3.1. Langevin equation 

The stochastic evolution of the system on the reduced phase space Fgyg can be 
described by a Langevin equation: 

g = (lOa) 
m 



p = -dqV{q)-^p+^2D;i. (10b) 

Eqs. ([TO)) resemble the Hamiltonian equations of motion ([T|) with two additional 
(force) terms for the change of momenta. The first additional term phenomenologically 
models the solvent friction with drag coefficient 7. The second term is the stochastic 
force ^ with strength characterized by Dp. The statistics of the force ^ are those of 
white noise, 

(C(t))=0, {Utmt'))^5^3&(i-t')^ (11) 

where denotes the ith component of ^, 5ij is the Kronecker-symbol and 5{t — t') is 
the Dirac 5-distribution. The (5-distribution for the correlations is an approximation 
to the real collision statistics that requires that the time scale of observations Tobs is 
much larger than the typical time of a microscopic collision, Tauc- For the momenta 
to follow a Maxwell-Boltzmann distribution in the steady-state. Dp has to obey the 
fluctuation-dissipation relation 

Dp - ^kBT, (12) 
where T is the temperature of the bath and is Boltzmann's constant. 

^ That this trajectory also samples the transient regime equally well is usually tacitly assumed, but 
has no deeper justification. 



7 



3.2 Smoluchowski equation 



The evolution of the probabiHty density g'^^^x) of finding the system at time t 
at phase space point x € ^sys is governed by the Fokker-Planck (or in this case, the 
so-caUed Kramers-Klein) equation 

dtg^'Hx) = ~Y.[^i3^ + ^p^p\ (13) 

with current densities 

(14) 

Note that we use the notation introduced in Eq. ([T]) in section O 



^ 777, 



3v = 



idqV)--/- + jkBTdp 

m 



3.2. Smoluchowski equation 

Often it is not feasible or necessary to explicitly consider the momenta p. On average, 
the momenta reach the value (p) = —^dqV^ on a time scale Tp := ^. If the 
system's particles are atoms or small molecules, Tp ~ Tmic- If the system's particles 
are already mesoscopic objects and a (hydrodynamic) radius R can be defined, Stokes 
friction yields Tp cx /i^^i?^ with the dynamic viscosity of the environment fj,. For 
instance, for colloidal silica of a couple of hundreds of nanometers in diameter one has 
Tp « 10~"'^'^s which is well below usual observable time scale Tobs- 

In general, if Tobs 3> Tp the momenta are always relaxed to their equilibrium 
values and one can consider an overdamped dynamics using the Smoluchowski equation 
generating dynamics on the space of configurations Fconf: 

q = fidgV + V2D^. (15) 

Here, /i = i is the mobility and D is the diffusion constant. Using either the FDR 



(|12p or demanding the steady-state distribution to be of Boltzmann form yields the 
so-called Einstein-Smoluchowski relation: 

To find a typical time-scale Tq for the overdamped dynamics psp suppose that the 
system is close to its equilibrium condition and therefore a harmonic approximation 
for the potential energy holds, i.e. V{q) ~ ^q^ with some spring constant k which is of 
the order of the typical interaction energy (for biological system a few k^T « 10^^'^ J) 
divided by the square of the typical length-scale of the interaction (typically on the 
order of nanometers). Then the force is linear in q yielding a time scale Tq — j. Again, 

using Stokes friction one finds for typical values ^ ~ ^^-^ ^ 1. This justifies that we 
neglect the dynamics of the momenta in favour of the dynamics of the coordinates. 

However, in typical single-molecule experiments on proteins or DNA Tobs 3> Tq. In 
addition, spatial resolution is usually not good enough to resolve the dynamics of single 
coordinate degrees of freedom. This is why one might be interested in much slower, 
collective dynamics happening on larger time- and length-scales which are accessible 
to experimental observation. One example for this further separation of scales are 
configurational changes or folding in proteins occurring on time scales Tconf ~ lO"**- 
10"s. 
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3.3 Stochastic dynamics of mesoscopic observables 



3.3. Stochastic dynamics of mesoscopic observables 

If the detail of observation on a system is restricted to certain meso- or macroscopic 
observables M{x) — Ai{q), it is natural to ask for a (stochastic) description only 
involving the possible values ui G that these observables can take. As every 
configuration q corresponds to exactly one observable state u — Ai{x) G il, the 
allowed values uj then partition Fgys into disjoint classes (cells) 

C^:^{xeT,ys\Mix)^iu}. (17) 

We will distinguish two different situations. 



Continuous observable Firstly, let M be some continuous function of x. The 
classes C are then hypersurfaces in Pgys. Often, in that case we can write down 
a phenomenological coupled Langevin equation for the stochastic variable oj: 

cO = /i[-(VF(^)) + /] + C. (18) 



If u! is multi-dimensional, the mobility tensor /i couples the different components of lo 

and C is generalized white noise with correlation matrix {C{t)C{t')) — Ik-oTjiS^t — t'). 

F(uj) is a phenomenological potential (which should be interpreted as a free energy, 
see below) and / is a non-conservative force. 

The non-conservative force can arise through the projection of Tsys onto if A4(x) 
is a non-monotonous function yielding classes C^j which are not simply connected. For 
instance, consider a description where M{x) = M{q) is periodic in q, i.e. where the 
effective dynamics is done using periodic boundary conditions to model an infinite 
system with an external force Fpot = —W. In this case, the force governing the 
dynamics of ui turns into a non-conservative force / acting on a torus. 

The Fokker-Planck equation for Eq. (fT^ is 

dtP^*'^ = -Vj = -V (^^h(VF(c^)) + / - A;bTV] . (19) 

Discrete observable Another possibility is that we already start with a disjoint 
discrete partition {Ci}f^Li and assign values uji to each of its elements, i.e. 

M 

M{x)=Y,[ujacA^)]- (20) 

1=1 

The classes Cuj^ = Ci are identical with the elements of the partition. 

In this case the discrete dynamical trajectories ui(t) are created by a Markov jump 
process rather than by a Langevin equation. The probabilities p^^ of being in state i 
at time t evolves according to the Master equation, which is a discrete version of the 
Fokker-Planck equation: 



M 

a (*) r j (*) i (t) 

3=1 



(21) 
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3.4 Local equilibrium and intrinsic entropy of observable mesoscopic states 



We assume that the jump rates Wj > for i ^ j and = — J2j time 
independent. As the original physical dynamics is time-reversible, each trajectory 
that leads the system from Ci to Cj must also have an allowed reverse trajectory. This 
condition of dynamical reversibility can only be fulfilled if jumps can always happen 
in both directions, i.e. > <^ u;^ > 0. 

One can also consider the jump dynamics as a random walk on a graph with M 
nodes with edges {i,j) if and only if Wj > 0. This graph can have arbitrary topology 
including cycles, which are the analogue of the non-Euclidean {e.g. toroidal) geometries 
for the continuous case. 

3.4- Local equilibrium and intrinsic entropy of observable mesoscopic states 

We just saw how a further separation of time-scales between Tg and the typical 
time-scales of observable collective motion Tconf > Tobs lead to effective mesoscopic 
descriptions of Eq. and Eq. (PT|) . Physically, to justify the Markovian stochastic 
dynamics on the level of the mesoscopic cells, one has to assume equilibrated cells. This 
means that the dynamics happening within a cell reach a constrained equilibrium, 
which is, by definition, memoryless. Local equilibrium is the key concept for the 
consistency of coarse-grained descriptions [HIT^IIH]- Following Seifert [31] we introduce 
the notions necessary in the present context: 

Let A4 describe some measurable observable that is to be treated within the 
framework of stochastic thermodynamics. We assume that all (coordinate) niicrostates 
q in any cell defined by this observable equilibrate on time-scales Tg that are much 
shorter than the time-scales Tconf of transitions between mesoscopic states. With 
/3 = (fefiT')^^, the conditioned microscopic distribution g{q\uj) of finding microstate q 
given mesoscopic state lo reads 

giq\L,) = exp (-/3(y(q) - F{u)) , (22) 

where V{q) is the microscopic potential on Fconf and 

F{uj) := - fcBTlog [exp {-fSV (g))] j (23) 

is the free energy of state lu. F ~ e — Tsi^t can be split into the constrained internal 
energy 

and the intrinsic entropy of state w, 

Sint(w) = -ftB [g{q\uj) log g{q\uj)] . (25) 

q&C^ 

Here, we formulated everything only with coordinates q. In underdamped cases, 
usually a Maxwell-distribution for the momenta p is assumed, which corresponds to 
a fast equilibration of the momenta. The difference that would occur both in the 
intrinsic entropy and free energy is then only an (unobservable) constant. For more 
mathematically rigorous statements on the general issue of considering entropies of 
reduced descriptions see Ref. [24] . 
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3.5 Mesoscopic trajectories and trajectory averages 





(27) 



3.5. Mesoscopic trajectories and trajectory averages 

The goal of stochastic thermodynamics is to formulate sensible thermodynamic 
balance equations on the level of stochastic trajectories w. In this context we will 
consider mesoscopic trajectories that are discrete in the space of observations but 
continuous in time. A similar treatment for the case where a; is a continuous variable 
can be found in Refs. |331 [Ml 155]. 

A trajectory a; is a generated by a Markov jump process, i.e. it is a random 
variable. Almost all trajectories can be described by two countably infinite sequence: 

(26) 

In the first row the different states visited by the trajectory are listed. The second 
row contains the jump times Tj of the jth. jump occurring between states Wj-i and ujj. 
A finite trajectory w(t) runs only for times t e [0, r]. The number of jumps n = n(r) 
is a random variable. The finite trajectory uAt) can be described by two vectors of 
length n + 1: 

W„(r)-1 ^n{T) \ 
T„(r) T ) 

The last entry in the second row is the temporal length of the trajectory, t. In both 
cases, we denote the mesoscopic state at time t £ [0,t] along a trajectory by U!{t). 

The weight of a trajectory of length r on the appropriate trajectory space is 
denoted by P[w]. Henceforth, to stress a functional dependence we will use square 
brackets [•]. For instance for the formulation of fluctuation theorems, it is useful to 
split the trajectory weight into an initial and a conditional part: 

Py =pL"^-p[^I^o]. (28) 

Let A[ui; t] = A [w(t); t] be a functional of trajectories running for time t which 
can also explicitly depend on r. The time-dependent trajectory average is defined as 

((A)):=^[Pyr)]Ayr);r]]. (29) 

If A [uj_; t] — A{ljj{t);t) only depends on the final state a;(r) of the trajectory, we say 
it has a local form. Local forms obey 

((A))=5][p^^)A(c.(r);r)] = {A)'^^\ (30) 

i 

i.e. they can be expressed as trajectory averages. A special case of local forms 
are state- variables, where A{uj) denotes a state function in the sense of classical 
thermodynamics. Examples of non-local physical observables are current variables 
A[oi', t] = J2k [A{i^k-i,^k', t)S{t — Tk)] which only depend on single jumps U!k~i i^k 
that occur at time t^. For such observables, the trajectory averages reduce to current 
averages which involve summing over two states (hence the subscript 2): 

((^)) = E \p'^^Ai^,J;r)\ = (A)^^^ . (31) 
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3.6 Thermodynamic balance 



3.6. Thermodynamic balance 

Single jumps As we will see below, the entropic trajectory functionals of stochastic 
thermodynamics are all linear combinations of three fundamental functionals, which 
are either state variables, local forms or current variables. Here, implicit time- 
dependence enters through an explicit dependence on the solution of the equation for 
the evolution of the ensemble probabilities. The same ideas also apply if the transition 
rates explicitly depend on time. We start by considering the changes associated with a 
single jump from LUi to ujj happening in an infinitesimal time interval. We will denote 
the changes encountered at a jump by the symbol d for state variables, A for local 
observables and S for current variables. 

The change in internal energy is given by de — e{ujj) — e{LOi). If additionally a 
non-conservative (generalized) force / is acting an additional amount of (mechanical) 
work 6w is dissipated. The change of heat 5q — de + 5w in the medium consists of the 
(negative) change of internal energy plus the dissipated energy. Therefore, the change 
of entropy in the medium is given as 

Sq de Sw 
dSmod = 7^ " ^ + ^■^^> 

To formulate the second law we need to identify all terms corresponding to the 
change of the total entropy, 6stot, in system and environment. However, it is not 
enough to identify the sum of Jsmed and Asint = Sint('j-'j) — Sint(wi) with Sstot- If 
it were so, one can easily construct a violation of the second law where <5stot is not 



5iiy 



positive on average 

Therefore another term is needed to account for the current state of the ensemble. 
This term will be of local form but not a state variable, hence we denote it by Asvis- 
The subscript indicates that this is the visible entropy, which an experimenter could 
measure by sampling the ensemble. 

Putting everything together, we find for a jump from coi to LOj 

Sstot{i j) (5smcd(« j) + dSint (« ^ j) + Asvis(i j). (33) 

The instantaneous average rate of change of the total entropy, Stot; is obtained 
by summing over all jumps with the corresponding joint probability rates and yields 
a dynamical formulation of the second law: 

t 



:=5][pf«^;-5stot(*^j)] >0 (34) 



p,-*' is the solution of the Master equation pT|) at time t for some given initial condition. 

The only thermodynamically consistent choice for Etot that fulfils the second law 
is a form resembling a KuUback-Leibler divergence [35] ■ It has frequently been used 
in previous literature (c/. Refs. [^Tl [TB] ): 



(*) i 
p) 



> 0. (35) 



+ The argument of Ref. |34) goes as follows: Consider a system without external forces and two states 
with the same internal energy but different intrinsic entropies. An ensemble that initially consists 
only of systems in the state with the higher intrinsic entropy would be smeared out over both states 
while decreasing its entropy in the process. 
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3.6 Thermodynamic balance 



The stochastic entropy can now be identified with the ensemble-dependent part in the 
above equation: 

As,i, := -fcfi logpj*' - (-A:b logpf . (36) 
The remaining part summarizes the other two contributions: 

Ssmot = (Jsmod + dsint = log ^ =: . (37) 

wi 

To simplify notation, we introduce the quantity 

B]:=fcBlog^ (38) 
wi 

also previously defined in the early works of Hill [T^, in the context of fiuctuation 
relations for stochastic dynamics [711 HZl [Ml El E] and related works [I] [2]. We 
suggest to call (local) motance, hence also the name of the corresponding entropy, 
Smot- It is the ability of the physical system (independent of the ensemble) to 
facilitate thermodynamic motion. It usually corresponds to changes in the natural 
thermodynamic potentials of certain physical situations j34|. 

Balance along trajectories We want to extend the balance for a single jump to a 
balance involving trajectory-dependent functionals, which are an easy approach to 
the fiuctuation relations for stochastic dynamics. Instead of starting with the physical 
functionals for the entropy production in system and medium, we identify three 
fundamental functionals with distinct properties, which constitute the building blocks 
of all other functionals. The first functional is related to the intrinsic entropy, which 
is a state variable. Consequently, this functional is independent of the ensemble and 
does only depend on the end-point w(t) of a trajectory w(r): 

Sint [w; r] = Sint (w (r)) -fcfi [£'(9|w(t)) log £i(q|w(T))] . (39) 

Unfortunately, one never has direct access to Sint because the intrinsic entropy is 
not visible in the coarse-grained description. However, we can measure the ensemble- 
dependent visible entropy, which is also a local functional: 

Svis[w;T] = Svis(w(T)) = -fee log p^"!^^), (40) 

The entropy associated with the motance Ssmot cannot be written in difference 
form. It depends on the jumps along the trajectory, so the corresponding functional 
is a current variable an will depend on the whole trajectory co{t). We introduce the 
motance of a trajectory s,not, which needs to integrate all jumps from state ujk-i to 
state uJk that have occurred at times Tk < t: 

s^ot[^:T] = fcB 5] [e(T - Tfc)B--i] . (41) 

k 

Mathematically, the motance is the time-antisymmetric part of the action Lagrangian 
in the sense of an action functional for trajectories [23l [35]. It consists of a part 
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3.7 Averaging over trajectories 



corresponding to a heat released to the medium and a change in intrinsic entropy. 
From those quantities one can construct the functionals that quantify the entropy 
change in the system and medium, Ssys[w] and Smcd[w], 



Ssys[w] = Svis[w] + Sint[w], 
Smed[w] = SmotM " Sint [^ii] 



(42) 
(43) 
(44) 



The instantaneous rates of change of the entropies have previously been identified 



log 



CTint M 



fc 



Pui{t) k 
[(Sint(wfe) - Si„t(Wfc_i)) (5(r - Tk)] 



5{t - Tk) 



k 



(45) 
(46) 
(47) 



3. 7. Averaging over trajectories 

Strictly speaking, of the above defined entropic quantities only the intrinsic entropy is a 
state variable and therefore has a counter-part in classical thermodynamics. However, 
upon averaging over trajectories, we recover the thermodynamic interpretations of 
Markov processes originally used by Hill and later Schnakenberg P^Tl [5T| . We start 
with the visible entropy. As a local quantity, it reduces to an ensemble average which 
is the usual Shannon entropy of the (discrete) ensemble: 



*S'vis(''') ■ — {{Svis)) — (Svis) J- — 



logp! 



(48) 



Let us look at the time derivative of this entropy 

123 



Svis(T) 



dr 



B ^ 



p\ 'Wj log 

P 



(r) 
i 

(r) 



(49) 



Because the average is linear, we can also write it as the difference of the trajectory 
averages of the terms dtot and (Xmot, which can also be explicitly calculated. 



Svis(T) = ((o-tot» - ((c^mcd)) 



= fee 



E 

hi 

Stot(T) 



(t) i 

(r) ^ , Pi ^3 

Pi ^j log^^ 

p) wj 



hi 



p\'^^w) log 



dr). 



(50) 
(51) 
(52) 



Often, Svis and Emot are identified with the entropy production rate in the system and 
environment, respectively, which is only true if we agree to forget about the change in 
intrinsic entropy. However, in a steady state the average intrinsic entropy is constant 
and such an identification is correct again (though trivial, because Evis ~ 0). 

* Here, to avoid dealing with time derivatives of random variables, we do not denote them by s but 
rather by the symbol a. 
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3.8 Summary 



3. 8. Summary 

In this section we have reviewed the basic concepts of stochastic thermodynamics. The 
main resuh was the identification of the three fundamental entropy functionals Sjnt 
, Svis and Smot together with the functionals for the variations, Uint , cTvis and Umot- 
The crucial, yet somewhat uncommon observation is that the trajectory functionals 
can depend on the current ensemble, i.e. the solution of the ensemble evolution 
equation. Further, the functionals were shown to be consistent with older results 
on the thermodynamic interpretation of Markov processes. We deliberately skipped 
the application of the approach, like the derivation of fluctuation theorems, which can 
be found in the literature [531 1331 ISS] . 

4. Deterministic Thermostatted Dynamics 

In the last section we have outlined the steps of how to arrive from a Hamiltonian 
dynamics at trajectory-dependent functionals for entropic terms of stochastic 
thermodynamics. Stochasticity was introduced from the beginning to deal with the 
ignorance of certain aspects of the system which lie beyond the scope or resolution of 
our observations. The rest of this paper is dedicated to accomplishing the same using 
deterministic dynamics. 

In this section we will outline the same conceptual steps as in the previous section 
only in the deterministic framework: We will review thermodynamic consistency, 
reduction to overdamped dynamics and finally the coarse-graining to a mesoscopic 
description. 

General scheme 

Throughout this section, we will follow the systematic approach to deterministic 
thermostats presented in Ref. [30| . This general scheme resembles the formulation 
of a Langevin equation: One starts with an equation of motion for the degrees of 
freedom of the system under consideration. After that, a phenomenological drag term 
is added to model the infiuence of the environment. Instead of adding a stochastic 
noise term, one either promotes the drag coefficient to a dynamical variable or adds 
a deterministic "noise" term mimicking the fluctuations caused by the environment. 
As in the stochastic case, the choice of these terms has to be consistent with the 
thermodynamic properties of the bath. This is achieved by demanding that the 
stationary distribution of the physical degrees of freedom obtains a canonical 
form. 

^.2. Nose-Hoover thermostats 

We will exemplify the above method using the Nose-Hoover scheme, which is the 
deterministic analogue to the full Langevin equation Eq. (fTO| . The equations of 
motion are 




(53a) 



P = -dgV{q) - ^p, 
m 

1 = 9{q,p)- 



(53b) 



(53c) 
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4.3 Configurational thermostats 



Demanding that 



(xexp(-/3(y(q) + ^ 



P 

2m 



cf>(7) 



(54) 



with a quadratic term $(7) {i.e. Gaussian distribution of the values of 7) one arrives 
at 
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m 



dNksT 



(55) 
(56) 



Here, d is the dimension of physical space and Q =^ dNk^Tr^ is a constant related 

to the time scale Tp of the relaxation of the momenta. The discussion in the previous 

2 

section motivates Q = dNkBT^, where 7 is the phenomenological drag constant. If 
such a drag constant is not known, Q is a free parameter of the dynamics. 

Time averaging of equation Eq. (|56p in the sense of Eq. (O leads to the relation 



E 



m 



dNkBT. 



(57) 



It describes equipartition of the momenta coordinates at temperature T. Therefore, 
the dynamics of 7 ensures that the momenta relax to their equilibrium values 



4.3. Configurational thermostats 

Configurational thermostat are the deterministic analogue to the overdamped 
Smoluchowski equation (jlSp . It is assumed that the momenta have relaxed to their 
equilibrium values and the dynamics can be described by the equations 



kBTd'gV 



(58a) 
(58b) 



Again, the form of the dynamics of Jl is found from demanding the canonical form 
for the stationary distribution of the coordinates. Observe also how here we find the 
definition of Rugh's configurational temperature |27] appearing naturally in Eq. (|58bp . 
As above, the constant can be used to set a time-scale for relaxation of coordinates. 

The problem with this approach is that the dynamics is not ergodic. Mechanical 
equilibria dqV = act as attracting fixed points where the system comes to rest. To 
restore ergodicity, further modifications of the equations of motion are required. In 
the so-called SDC scheme this is done by introducing another dynamical variable, ^, 
that is used to shake the dynamics around mechanical equilibria, similar to what the 
stochastic noise term would do in the Langevin equation ^30] . The equations of motion 



are 



q = ^^^gV + C 



(59) 



(t As in the stochastic case, it is an assumption that the trajectories generated by the thermostatted 
equations of motion can describe the transient relaxation towards equilibrium, too. 
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4.4 Dynamic reversibility 



with the dynamics of fj, as in Eq. (|58bp and 



(60) 



with h leading to a dynamics that satisfies C ' dqV = 0. There are essentially 
three different possibilities to satisfy this condition which correspond to differently 
constrained fluctuations around mechanical equilibria. 

Actually, the full SDC scheme goes further and introduces a dynamical drag term 
for the coordinates, too. If this is done, the connection to statistical physics here is 
obtained similarly to the Nose-Hoover case, only that one uses equipartition for the 
virial ^ q ■ dqV{q) rather than for the mean kinetic energy. 

It has not been proven if or under which conditions the dynamics created by 
the SDC thermostatting scheme are ergodic. However, numerical simulations using 
deterministic SDC schemes as well as a variant [2^, the Braga- Travis (BT) scheme 
[1], indicate ergodicity. 

4-4- Dynamic reversibility 

A very important aspect of the SDC dynamics is their dynamic reversibility. Dynamic 
reversibility means that we can basically let the dynamics run backward in time if 
we apply a time-reversal operator I : F — > F to its microstates. For later reference 
we now formulate the general definition of reversibility for a dynamical system on 
phase space F governed by the flow $*, Eq. ([5]) [211 HSl [50] . Here, F is the extended 
configurational phase space with elements x — (q, a) consisting of the coordinates as 
well as the additional dynamical variables a — (/i, C, •■•)■ Now let (F,S, A) be the 
measure space of the thermostatted system with Lebesgue measure A on the Borel 
sets B. 

We call the dynamics reversible if and only if there is a mapping I such that 



In the case of the SDC scheme the time-reversal operator that fulfills the above 
properties is I{q,a) = {q,—a). For Hamiltonian dynamics, it is I{q,p) — {q,—p)- 
But there are also other kind of abstract dynamics, like multi-baker maps, where such 
a condition of reversibility holds (for a review c/. Ref . [35] ) • 

5. Dynamical Notions of Entropy and Information 

In this section we will use reversible deterministic (thermostatted) dynamics to derive 
analogues of the trajectory-dependent entropic expressions ()40II47|) using the concepts 
of information theory. Unlike the previous sections, which mostly reviewed existing 
results, the work in this section is original though the basic notions were inspired by 
works of VoUmer and co-workers and others [51 [351 ISHl IMj 

The outline is as follows: First we will partition the phase space of the system 
into disjoint mesoscopic cells Cuj- As in the stochastic case of section [31 these cells 
correspond to the possible observations (measurements) w G Our information 
of the system on the mesoscopic level is quantified by an evolving coarse-grained 



T oT = id (involution), 
\{I~'^A) = \[A), \/A e S(measure-preserving), 



(61) 
(62) 

(63) 
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5.1 Mathematical set-up 



density p. In parallel, the evolution of a microscopic density g is considered, which 
contains the full information about both state and history of the system. The 
(experimental) uncertainty of the state of the system can be expressed as the relative 
information or KuUback-Leibler divergence of the two densities. In order to understand 
the interpretation and evolution of the information-like quantities in the context of 
statistical physics, we need the evolution of the densities. Therefore, an analogue to 
the crucial assumption of equilibrated cells or local equilibrium needs to be formulated. 
Additionally, the fact that certain fast or auxiliary variables cannot be observed 
imposes a time-reversal-symmetry condition on the mesoscopic states. 

5.1. Mathematical set-up 

Starting point for our discussion is a reversible, deterministic dynamics on some space 
r. As a paradigm we can think of the flow generated by the thermostatted 
equations of motion l[59|) . 

In order to account for the finite time resolution of real experiments, we take a 
stroboscopic point of view on the dynamics. Mathematically, this means that from the 
continuous dynamics we obtain a deterministic, discrete map $ = <I>'^ 



by fixing a small time-step r = Tobs and only look at the microscopic state of the system 
at intervals of r. After that, we could let Tobs tend to zero to arrive at a continuous 
description. In terms of Markov processes we obtain a Markov jump process described 
by a Master equation (PT|) from a discrete time Markov chain. However, the physical 
assumption of separation of time-scales still applies, so one must not assume validity 
of the continuous description for too small time scales t ~ T,„ic. Henceforth, the index 
for the discrete time is denoted e Z corresponding to t — tv. 

5.1.1. Observahles and partitioning In order to connect to the statistical mechanics 
of mesoscopic systems we partition phase space into discrete cells Ci <Z T through 
observables of the form of Eq. (|20p . A crucial condition on the measurable observables 
M is that they are invariant under time- reversal, i.e. a. single measurement cannot 
distinguish the direction of time: 



Physically, this excludes unobservable phenomena from the definition of our 
mesoscopic states. Thinking of the SDC-thermostats this is the condition that the 
observables should only depend on the physical coordinates q and not on the auxiliary 
variables a. In the underdamped situation this means that observables shouldn't 
depend on the fast momenta. However, this condition also applies for the mesoscopic 
cells that constitute the multi-baker maps used by VoUmer [51 [35] . 

To continue, we will need the phase space volume H^ of phase space cell Ci, which 
will be its Lebesgue measure. If the (extended) phase space F has a finite measure, so 
will the cells. The real space with coordinates q is always considered to be confined 
to a finite volume, hence it has a finite measure. However, this does not need to 
be the case for the momenta or auxiliary variables. But because these variables are 



$: r ^ r. 



(64) 



M{Ix) ^M{x). 



(65) 
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5.1 Mathematical set-up 



unobservable, cells will always be direct products of a finite observable configurational 
part, Cobs.w and possible infinite parts Cunobs which do not depend on the value of ui. 

Therefore, when we write 11^ — \{Ci) we mean the measure projected onto the 
observable, configurational phase space. Formally, let {Tconf,B,X) be the measure 
space of the (extended) thermostatted system with Lebesgue measure projected on 
the Borel sets B of the finite, observable part Fconf . 

Having ensured finiteness of the measures, we can establish some further 
definitions: 

Let C™ C Cm be the set of all points that are mapped from cell m to cell n. 
Denote s™ its relative volume in cell ra. Similarly, let C C„ be the set of all points 
in cell n with pre-images in cell m and s™ its relative volume: 

C ■={xe Cm ■■ e Cn} (66a) 
n™ := A(C) (66b) 



n" 



(66c) 



(66f) 



C := {y e C„ : ^-^y G C,„} (66d) 
■■= A(0 (66e) 

~m n 

Because the cells form a disjoint partition one has 

n n 

5.1.2. Images and pre-images under reversible evolution We assume that the discrete 
dynamics is reversible with an Involution I in the sense of equations ([CTH55)) . Because 
of the constraint Eq. (p5|) the involution is necessarily local on cells, meaning that 
mesoscopic cells are invariant sets under the involution: 

IC„=C„,Vn. (68) 

With this relation and the notion of dynamical reversibility (pTI - 155)) it follows that 

C = (69) 

This reason for this is that 

5™ = ^ n;r ^ A(C) = A(C) ^ l^m (70) 
by definition and further one has 

C = {y-{ye C„) a {r'y g Cm)} 
^ IC - {^■■{z= ly) A (y e c„) a [r^y g Cm)} 

^ {z : {Iz G Cn) A {r^Iz G Cm)} 
^ {z : (z G ICn) A {ir^Iz G ICm)} 
'^z:(zGC„)A(0zGC„)} 

nn 

— '-'mJ 
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5.2 Information theory and entropy 



which means that the role of images and pre-images is exchanged under the reversed 
dynamics. Because the involution is measure preserving ()62p we find 

nr„ = A(c) = A(ic) = A(c) = n:r (71) 

which proofs Eq. (p^]) . 



5.2. Information theory and entropy 

To connect the deterministic dynamics on the cells to thermodynamics we define 
entropic quantities motivated in the context of Shannon's information theory 
We equip the space (F, B) with a family of measures ^^^^ that are parametrized by 
discrete time v. We assume that any member of the family /i^''^ has a density g^'^'^ [x) 
with respect to the Lebesgue measure A. We will denote g^^^ the fine-grained ( or 
microscopic) density. 

The central quantity of information theory is the information functional of a 
density g over an area of phase space C C F: 

S[g\C] := -kB I g{x) log g{x) dx (72) 
Jc 

The standard Shannon entropy is the information functional evaluated over the total 
phase space, S[g] = S[g\T]. Another quantity that will appear frequently is the 
relative information of density g' with respect to density g, which can be formulated 
as a KuUback-Leibler divergence: 

2?KL [q II q'] = kB [ g{x) log 44 > 0- (73) 
Jc Q \^) 

5.2.1. Coarse-grained density Additionally to the microscopic density, we define a 
coarse-grained (or mesoscopic) density pm"* to be the averaged microscopic density of 
a cell Cm'- 

Pt^-^ I Q^''\x)dx. (74) 

It resembles the fact that we can only measure up to a certain resolution that is defined 
by the mesoscopic observables M{x). That is, we can see a system in a mesostate w 
but never infer the actual microstate x from that information. 



5.2.2. Fine- and coarse-grained entropy For simplicity, in this paragraph we will 
suppress the superscript because we are not dealing with any evolution of the 
densities. We will refer to the constrained Shannon entropy of g on cell Cm, 



S [g I Cm] = -fee / q{x) log g{x) dx (75) 

= -kB / g{x)\og(nmgix)) dx -i- kB / ^(a;) log n„ da; (76) 

JC„, JCra 

= -ks g{x)\0g{nmg{x)) dx -\-IlmPmSint{m), (77) 
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5.3 Entropy production and variation 



as the fine-grained entropy.ln the last Hne we introduced the quantity 

Sint{m) ~ fee log n„ , (78) 

which — in analogy to section[3] — we also call the intrinsic entropy of state m. However, 
here it is defined in the sense of Boltzmann as the logarithm of a phase space volume 
rather than in the sense of Gibbs or Shannon. This identification will be justified also 
by the final results. 

We also introduce the coarse-grained entropy of cell m using the coarse-grained 



density p{x) = Y.,n PmXc^ix)- 

S [p I Cm] = -kBUmPm log Prn (79) 

= -fcellmPm log (TlmPm) + kB^mPm log Ilm (80) 

= -kBUraPm log {TlmPrn) + n,„p„Sint(TO) (81) 

= n„iP„(Svis("^) + Sint(TO)) (82) 

= n,„p„i(scg(m)) (83) 



The last line splits the microscopic density into two local contributions that are 
weighted by the factor IlmPm- The first part is the intrinsic entropy from above, 
which is a thermodynamic state variable but not directly observable. The second part 
is a local quantity, too, but it additionally depends on the ensemble density p. We 
call this part the apparent or visible entropy, 

Svis(nT-) = -fee log (n„ip„i) , (84) 
because this quantity can be inferred from experimentally obtained ensemble statistics. 

5.2.3. Relative information and observable entropy Another important quantity is 
the difference between the coarse- and fine-grained entropy S [p \ Cm] ~'S[q \ Cm] , which 
was introduced in Ref. [5] in the context of multi-baker maps. It can be written as a 
KuUback-Leibler divergence and is therefore always positive 

S [p I Cm] -S[q\ Cm] = -DkL [g\\p\ Cm] = f g{x) log ^ dx > 0. (85) 

From information theory we know that this quantity is the relative information of the 
two descriptions, i.e. it quantifies the uncertainty of the microstate of the system if we 
know the mesostate. As the system equilibrates from its initially prepared state, this 
uncertainty becomes larger and therefore this quantity should never decrease. Later, 
we will prove this and identify its rate of change with the irreversible, total entropy 
production rate Stot. 

5. 3. Entropy production and variation 

Under the assumptions of stochastic thermodynamics it is possible to obtain the 
full-time evolution of the microscopic density, if it was initialized uniformly on the 
mesoscopic cells. The idea is to follow the contraction and squeezing of subsets of 
phase space that move along the same mesoscopic trajectory. 
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5.3 Entropy production and variation 



5.3.1. Discrete trajectories, co-moving microstates and equilibrated cells In the 
stroboscopic picture the cvohition operator is a discrete map that generates discrete 
mesoscopic trajectories. Therefore the mathematical treatment becomes more lucid 
and averages and probabilities can be explicitly calculated. Analogously to the time- 
continuous phase, we denote a semi-infinite mesoscopic trajectory by uj. If the 
trajectory is constrained to run until time t = i/Tobs it is denoted by It can 

be represented hy a, {u + l)-tuple: 



Where no ambiguity can arise we will write simply cj instead of w(i^). 
Further, we define the mesoscopic history of a point x G C^,^, 

ui{x) := {M{<!>-''x),M{<p-''+^x),...,M{<p-^x),M{x)) 

and the set of microstates that share the same mesoscopic history cv 

C M •= '■ '^{x) = uj}. 

This set can also be recursively defined: 



(86) 



(87) 



(88) 



(89) 



The quantity we are interested in is the volume 11 [w] := A (C [w]). To calculate it, we 
must formulate the assumption of equilibrated cells for deterministic dynamics. First 
recall its meaning in the context of time scales: It is the assumption (or better said, 
approximation) that any set of trajectories entering a cell Cm at time t is smeared 
out over the whole cell after a time-scale r < Tobs- Consequently, the set C [w^*^)] of 
microstates that arrived in cell C^^ C C [uA^^ at time k must cover Ci^^ densely. But 
this is equivalent to saying that the measure of the subset of C [iJ'''^] that will be 
mapped to Cu;^_^_i relative to 11 [oA''^ must be the overall fraction s^*_^j : 



A(C[a;('=)]) 



(90) 



Note that this is just the Markov assumption, if we interpret 11 [w^*^^] and s"'^^^ 
as the probabilities of observing the trajectory u/^''^ and a jump from to 
respectively: 



Prob 



Prob 



X Prob [wfe+i I ujk] 



(91) 



Further, define the contraction factor 



Ilwfc+lSwfc 



(92) 



Because the sets of co-moving trajectories are smeared out over the whole cell, we 
assume uniform contraction, i.e. 



A 



(fc-i) 



n 



,ik-l) 



■0- 



(93) 
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5.3 Entropy production and variation 



Using equations ([5^ . (I5D|) and the definition ([^ we find a recursive relation 
for n[w]: 



n 



After iteration we arrive at 



TT 



,(fc-i) 



n J"'^ 



n 



n 

k=l 



n 



,(fe-l) 



fe=l 



n.. n 

He., n ''^^i-. 



(94) 

(95) 
(96) 
(97) 
(98) 



fc=i 



wliere in tlie last line we used reversibility. 

Here we stress again, that for any real dynamics, the above assumptions have to 
be understood as an approximation in the same spirit as the Markovian approximation 
of the stochastic approach. However, dynamics that exactly fulfill Eq. (|98p can be 
constructed. The most well-known thereof are the reversible, "properly thermostatted" 
Multi-Baker maps used by VoUmer and co-workers [SI [351 [35] ■ In that particular case, 
there is no separation of time-scales or any notion of "equilibrated cells". The subtle 
(geometrical) reason why one can use Eq. ([55]) is that the images and pre-images 
of the map intersect in angles of 90°. Therefore, Multi-Baker type maps might be 
useful as models to study this connection. However, they do not represent (small) 
thermodynamic systems. 

5.3.2. Evolution of the densities On its way to cell Wi, along trajectory co the volume 
element C [uj] has been repeatedly stretched and squeezed. Denote by p^*^^ [w] the 
density of the measure /i'^'^^ in C [uj]. Because we start with uniform densities on the 
cells at time 1^ = 0, this is a constant. Probability conservation implies that 



w]H, 



This leads to 



After iteration we find the density at a point x £ H^^,^ with a trajectory ui{x): 



A)(:r)=p('')y(x)]=p.„n 



fc=i 



H, 
H. 



(99) 
(100) 

(101) 



The evolution of the coarse-grained, mesoscopic density p^"^ is obtained by integration 
over cell H„. ,: 



r 



(102) 



k=l 
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5.3 Entropy production and variation 



Here, ^™ means that we sum over all trajectories that end in cell Cm- The weight 
Pm. of cell Cm with respect to the measure /i^'') is 

:=AiW(C„)=n„pW. (103) 

It is easy to see that Eq. p02p is the solution of the discrete-time Master equation 



sip) 



(104) 



under the assumptions that we found an initially uniform density on each cell, 
i.e.p^ = IlmPm- In Eq- (|104p the take the role of transition probabilities. 
Now we define the weight of a mesoscopic trajectory uj 



fc=i 



(105) 



It obeys 



^py = i, (106) 

Inserting Eq. (jlOip into the definition of the fine-grained entropy Eq. (|75|) we find 



S 



k=l 



\k=l 



log- 



-t-log (n^oP^-^o) 



+ n„„p2C^Sint('^i/)- 



(108) 
(109) 



Also here we define B) := fee log ~ using the volume fractions sj. In this context it 
can be interpreted as the rate of phase space contraction [351. A more suitable form 



is: 



S 



''''McJ=-E^ 



py E^; 



k=l 



(110) 

(111) 



The first term is the entropy originating from the jumps of the trajectories that end 
in Iltj^ at time v. The second term is the visible entropy specified by the initial 
conditions. The third term is the part of the intrinsic entropy associated with cell w^. 



5.3.3. Recovering the trajectory junctionals of ST At this time it is worthwhile to stop 
and reflect whether it makes sense to attempt an interpretation of Eq. (jllip as the 
"fine-grained entropy of cell uj^ at time The problem is that such an interpretation 
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5.3 Entropy production and variation 



is local in space and time. However, even though we could eliminate the microscopic 
density, the first term now depends on the history of many (non-local) mesoscopic 
trajectories. Therefore it is impossible to describe this term without referring to the 
other cells or the memory of the process. 

Let us recall the functionals (|55H¥T|l from section[31 With the modified definitions 
of the intrinsic entropy, the weight of a cell Pm and the contraction rate Bj , the discrete 
analogues look formally the same: 



Sint[w, J^] = Si„t (uJi,) 



fee log n^^,, , 

-fcB logpirj. 



Smot [w, ly] 



i=i 



(112) 
(113) 

(114) 



With this we can express the sum of the fine-grained entropy of all cells with the help 
of trajectory averages 



E 

CJf) 

((Smot)) 



k=l 

pL^iogpW 

(0) 



E 



(115) 

(116) 
(117) 
(118) 



The first and the last term of the last line are trajectory averages of the form of Eq. 
though the latter could also be expressed by a trajectory average of a local quantity 
using Eq. (|30p . The second term is the visible entropy of the initial ensemble. 
The coarse-grained density of all cells reads: 



E 



= ((Svis)) + ((Sint)) 

For the difference entropy S^d we find 

^rc\ ^ (Svis)^''^ - (Svis)^*''' + ((Smot)) = Sl'^J - Sl^2 + (('^mot)) 



(119) 
(120) 

(121) 
(122) 

(123) 



The main result is that these identifications allow us to relate the physical 
entropies to the (Shannon-)entropies in the context of deterministic dynamics: 
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(124) 
(125) 
(126) 
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5.4 Connection to the continuous time description 



Because of the one-to-one correspondence, the relation to the three fundamental 
functional ()42ll44p hold as well. 

It remains to find the analogues of the entropy variations o . In the discrete-time 
case, this is easy, as cr[a;, i^] can be obtained as s[a;, i^] — s\o_^v — \\ for any discrete 
entropy functional s: 



CTvis[w, v\ = fee log 



P. 



0-int[w, v\ = Sint(w^) " Sint(w,._l), 



(127) 

(128) 
(129) 



Again, from these quantities one can construct the functionals of the physical 
entropy variations per unit time as well as their trajectory averages. A surprise in the 
discrete case may be the result for the average total entropy production per unit time 

Stnt — ((C- 



fmot/ 



2J 



Pi Si log — - 



^,3 L 



PtS 



Pjsi 



PiS) log 



L I i 



> 0. 



(130) 
(131) 
(132) 



The average total entropy production is, as expected, always positive. This can be 
seen from the second line where we write it as a KuUback-Leibler divergence. However, 
looking at the third line we see that it differs from the direct correspondence to the 
continuous case by a boundary term. Also these two terms have the form of KuUback- 
Leibler divergences. The negative second term becomes increasingly smaller as a 
steady state is reached. It has often been ignored by previous works because it does 
not appear neither in the continuous case nor in the steady state. 

In summary, we constructed the discrete versions of the fundamental functionals 
used in stochastic thermodynamics from deterministic dynamics. 



5.4- Connection to the continuous time description 

To arrive at the discrete-time analogues of the trajectory functionals one basically just 
has to substitute by in the continuous-time expressions (|4(JH47p . The variations 
(T were obtained as differences of the entropies. However, they can also be obtained 
from the continuous variations cr"^, which we will denote by the superscript c, if we 
consider a given continuous trajectory ur. 

Let us consider the time interval [i^Tobs, ('^+ l)''obs]- We assume that Tobs is small 
enough that at most one jump occurs at tj from states uj- := uj {{v — l)Tobs) = Wj^-i 
to 0J+ := oj {PTohs) = Wiy. An entropy functional s can be written as the integral over 
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5.5 Connection to previous works 



the continuous- time derivative functional: 



a =s[a;; [v + l)Tobs] - s[u, VTohs 

»(i^+l)rob3 

»(i^+l)rob: 



(133) 

/ (T"[a;(0]dt + / cr"[a;(i)]dt 



In the last line we split the integral into two contributions. This is important in order 
to evaluate the ensemble probability for the correct states. The first line implies that 
in the discrete case S is the average variation per unit time of the corresponding 
average value of S. 

For completeness, we mention that the continuous time description is usually 
obtained in a more formal way. This is done by converting the expressions in the 
discrete-time Master equation ()104p into its continuous-time analogue, Eq. ([21]) , in a 
limit procedure Tobs 0. The jump probabilities s^- for time spans Tobs are interpreted 
as generated by the infinitesimal generator {W)ij = (wj) [13]: 



5.5. Connection to previous works 

For consistency, we partly introduced new notation for different entropic quantities. 
Random variables are always denoted by minuscules, with their (trajectory) averages 
represented by the corresponding upper case letters. Further S denotes the time 
derivative or variation per unit time of its entropy S, in the continuous or discrete 
case, respectively. 

As a summary, we discuss the main entropic terms identified in this work again in 
detail and give an (incomplete) overview of the notation and interpretation in previous 
work. 



Visible entropy The visible entropy Svis was defined in Eq. as the Shannon 

entropy of the (discrete) ensemble. In the same spirit it is the observable part of the 
coarse-grained entropy, cf. Eq. (1120p . It can also be expressed as the average over 
the entropy functional Svis- 

It has been often simply called entropy S (in the context of Markov chains) 
[5T1 [TBI [31 [Hj- However, Lebowitz and Spohn have called it also Gibbs entropy Sq 
because of its functional form and to distinguish it from Boltzmann entropy. 



Intrinsic entropy The intrinsic entropy Sint(w) is the unobservable (local) equilibrium 
entropy of a mesoscopic state w. It can be calculated canonically from the constrained 
microscopic density as in Eq. ([^ . On the phase space of the thermostatted, 
deterministic system it is calculated in the sense of Boltzmann as the logarithm of 
the volume of a cell ([75]) [311 [3S|. In this spirit, is has also been called Boltzmann 
entropy Sb{M) of (mesoscopic) state M [21]. 
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5.6 Application: Isolated systems 



Fine-grained entropy The coarse-grained entropy S'fg is the Shannon entropy ()115p 
of the microscopic (Liouville) density g. For equilibrium systems, it is equivalent to 
Gibbs' entropy. Maes and Netocny denote it ^(/i) and make a strong point for not 
calling it Gibbs entropy [24j . which is the expression used by VoUmer and co-workers, 
who denoted it S'(^) 

Coarse-grained entropy The coarse-grained entropy is the Shannon entropy of 
the coarse-grained density p from Eq. (|119p . It consists of the visible entropy S^is 
and the intrinsic entropy of the current ensemble Sint = {s-mt)- VoUmer and co-workers 
call it simply the entropy S [51 [3S] , whereas Maes and Netocny denote it the Shannon 
entropy of the (projected, z.e. coarse-grained) measure, S{p) [M] . 

Total entropy production The total positive entropy production Etot is the quantity 
to be identified with the total dissipation [321 [31] . It is always positive due to its form 
of a KuUback-Leibler divergence. In the context of information theory it is the rate 
of increase of the uncertainty of the microscopic configuration over time, if one keeps 
track of the mesoscopic densities. 

It has previously also been called the internal non-negative entropy production 
[311 HI 131 in]- Notation has varied a lot including the symbols P [21], R [H], iA[ 
[TB] , [3] , [H] and 5**°* [33] ■ Usually, it has been considered for continuous-time 
Markov chains where it has the form of the first term in Eq. (|52p . However, as we 
saw above, for the discrete situation an additional boundary term from the finite time 
interval appears with respect to the continuous-time formulation. This was already 
noted by Gaspard [16], though his identification and interpretation is different than 
in the present context. 

Motance entropy variation The variation of the motance entropy Smot is facilitated 
by the local motance Bj. As we see from the deterministic discussion (|1181 the 
motance 5'niot is the main part of the fine-grained entropy and directly related to 
phase space contraction. Thermodynamically, it is the heat released to the medium 
less the variation of the intrinsic entropy [311 [33] . 

The latter contribution has often been neglected when Smot has meed identified 
with the entropy change in the reservoirs describing the medium of the environment 
[Sn . In the same sense it is often called the entropy flux to the medium or external 
entropy variation [161 13]. As for the case of the total entropy production, symbols 
used include P2 [31], ^ [22], -^K [H], [3], -S'e = -5^ [9 and S"°* [33J. 

5.6. Application: Isolated systems 

As Seifert stresses [31], it is crucial that the motance consists of the variation of 
intrinsic entropy as well as the entropy flux to the medium. Here, we show this 
explicitly using an isolated system evolving under Hamiltonian dynamics. We know 
that the invariant measure will be uniform on the constrained phase space (energy 
shell), corresponding to an equilibrium situation. Further, detailed balance should 
hold between mesoscopic cells X{Ci) — Hi. Without loss of generality we set the total 
phase space volume to unity, i.e.X{T) — J^i^i — 1- Then, because g°° — 1, we find 
that the equilibrium probabilities obey — Hi . From the detailed balance condition 

prs]=PT4 (135) 
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we then infer that Sj = CijUj with some constant Cy = Cji. But because J2j ^) — ^ 
for all i, one has to demand Cji ~ 1, Vi, j. 

By definition, an isolated system cannot allow any "entropy flux to the medium" 
at any time, i.e. 

((<7med)) = ((f^mot)) ' ((f^int)) = 0. (136) 

But this can be seen from 



XCmed 



)) = E 



Vis\ log 

si 



E Pi) log Hi] 

i 

XI IPi^j log - piUj log Ui - pjUi log Hi] 
+ ^[Piiogni] 

i 

J2 i-Pi^j logHi] +Y,\pi log Hi] 

«,i i 

5^ [(-Pi + Pi) log Hi)] = 0. 



6. Conclusion 



In this work we presented a new approach to the entropy functionals of stochastic 
thermodynamics using deterministic evolution equations. We used reversible 
deterministic thermostatted equations of motion as a paradigm. The finite resolution 
of any real measurement motivated the introduction of time-reversal invariant 
mesoscopic observables and the use of a stroboscopic picture. Under the assumptions 
of equilibrated cells we could follow the evolution of an observable coarse-grained 
and an unobservablc fine-grained density. This lead to the notion of coarse- and 
fine-grained entropy, whose dynamics is intimately related to the notion of entropy 
and entropy production in the system and its environment. Finally, we were able to 
construct the trajectory-dependent functionals of stochastic thermodynamics from a 
purely deterministic dynamics. We also made the connection of the discrete-time case 
to the more common continuous-time description. 

This work also hopes to disentangle some of the confusion in the context of 
nonequilibrium entropies. Maybe it can contribute to reconcile different views on 
entropy and entropy production through the one-to-one correspondence of phase space 
contraction and thermodynamic interpretation of (ratios) of transition probabilities. 

Further, the special role of Multi-Baker-Maps in the context of nonequilibrium 
thermodynamics was analysed: Rather than as simplified models for chaotic, many- 
particle dynamical systems they have to be understood as mathematical models that 
show a similar evolution, due to a geometric peculiarity. However, they might elucidate 
the connection between dynamical systems and stochastic thermodynamics further 
and future work in this direction is in progress. 
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